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Abstract 

The thermal stability and topology of non-canonical structures of G-quadruplexes and hairpins in template DNA were 
investigated, and the effect of non-canonical structures on transcription fidelity was evaluated quantitatively. We designed 
ten template DNAs: A linear sequence that does not have significant higher-order structure, three sequences that form 
hairpin structures, and six sequences that form G-quadruplex structures with different stabilities. Templates with non- 
canonical structures induced the production of an arrested, a slipped, and a full-length transcript, whereas the linear 
sequence produced only a full-length transcript. The efficiency of production for run-off transcripts (full-length and slipped 
transcripts) from templates that formed the non-canonical structures was lower than that from the linear. G-quadruplex 
structures were more effective inhibitors of full-length product formation than were hairpin structure even when the 
stability of the G-quadruplex in an aqueous solution was the same as that of the hairpin. We considered that intra- 
polymerase conditions may differentially affect the stability of non-canonical structures. The values of transcription 
efficiencies of run-off or arrest transcripts were correlated with stabilities of non-canonical structures in the intra-polymerase 
condition mimicked by 20 wt% polyethylene glycol (PEG). Transcriptional arrest was induced when the stability of the G- 
quadruplex structure (-AG°37) in the presence of 20 wt% PEG was more than 8.2 kcal moTV Thus, values of stability in the 
presence of 20 wt% PEG are an important indicator of transcription perturbation. Our results further our understanding of 
the impact of template structure on the transcription process and may guide logical design of transcription-regulating 
drugs. 
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Introduction 

Transcription is the first step in gene expression; it is highly 
regulated during both initiation and elongation. [1,2] Although 
fidelity of transcription elongation is critical for maintaining the 
accurate flow of genetic information, transcription elongation in 
cells and in vitro (Figure la) can be interrupted by certain sequences 
or structures. [3,4] For example, the RNA polymerase may slip 
back or forward on a template DNA during RNA synthesis at 
"slippage" sites such as the homopolymeric tract in human amyloid 
precursor protein gene. [2,5] In the amyloid precursor protein gene, the 
slippage results in a transcript 8- to 10-nt shorter or longer than 
the expected transcript, changing the coding capacity of mRNA 
(Figure lb). [4,5,6] Moreover, the rate of transcription elongation is 
also dependent on sequence. The average rate of transcription 
elongation by RNA polymerase II on relatively unstructured DNA 
templates in the absence of any additional factors is ~5 
nucleotides s~' under optimal conditions. [7,8] At some template 
positions called "pause" sites such as the A-T-rich transactivation 
sequence in the HIV genome, the polymerase may halt for as long 



as 1 min before continuing transcription. The pause results in 
decreased mRNA production and therefore less protein produc- 
tion (Figure Ic). At other locations called "arrest" sites, which can 
be a damaged DNA residue or a an A or T stretch, [5,6] a 
transcribing polymerase halts, and the 3 ' end of the transcript is 
displaced from the active site of the polymerase and the short 
transcript is released (Figure Id). [9] These arrested transcripts may 
misfold or result in translation of mutant protein. Like non-coding 
RNAs such as miRNAs and antisense RNAs, [10,1 1,12,13,14] 
these short transcripts may also have roles in regulation of gene 
expression. Transcription is important not only for cellular systems 
but is also in certain nano-materials such as logic devices. [15] If 
transcript mutations could be controlled, they might prove useful 
in medical, pharmaceutical, and materials sciences fields. 

We have shown that non-canonical structures such as triplexes, 
G-quadruplexes, and cruciform structures are stabilized drastically 
under cell-like conditions induced by cosolutes,[16] although a 
duplex is destabilized under the same conditions. [16,1 7] Recently, 
it has been reported that non-canonical G-quadruplex structures 
are present inside cells. [18] These results imply that non-canonical 
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Sequence name Sequence of X region (5'->3') 

Linear : GCCGTTTCGTAGTATTTGGGTTGTMCTATCGAGG 

HI : GCCGTTTCGTAGTATTTCTATCCGGacrrCGGTCC 

H2 : GCCGmCGTAGTCTATCGGACTTCGGTCCGATAG 

H 3 : GC CGT CCAACTATCGGACTTCGGTCCGATAGTTGG 

Ql : GCCGTTTCGTAGTATTTCGGrrAGGrrAGGTrAGG 

Q2 : GCCGnTCGTAGTAGGGTTAGGGTTAGGGTTAGGG 

Q 3 : GC CGT TTC GT GGGGTTAGGGGTTAGGGGTTAGGGG 

Q4 : GCCGTTTCGTAGTATTTCTAGGrrGGTGrGGrrGG 

Q5 : GCCGTTTCGTAGTATTGGGrrGGGrGTGGGrrGGG 

Q6: GCCGTTTCGTAGGGGGTTGGGGTGTGGGGTTGGGG 
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Figure 1. Effects on RNA polymerase elongation by structures in template DNA (a-d) and illustration of the template DNA (e, f). (a) 

An unstructured template, (b) a template with a slippage site, (c) a template with a pause site, and (d) a template with an arrest site, (e) The region 
denoted by the box marked with an X contains the sequence designed to form a random coil or non-canonical structure, (f) Sequence names and 
sequences of X regions. Sequences expected to form non-canonical structures are highlighted by italic and bold. 
doi:l 0.1 371/journal.pone.0090580.g001 



structures may play important roles in gene regulation. Addition- 
ally, we showed very recently that the formation of a G- 
quadruplex in an open reading frame (ORF) of mRNA suppresses 
its translation into protein. [19] It has been reported that 
quadruplex-forming sequences are enriched upstream and within 
gene promoters; structures formed in these regions may lead to 
transcription pausing via the inhibition of transcription initia- 
tion. [20] Moreover, hairpin structures formed by the nascent 
RNA may result in transcription pausing even if the hairpin does 
not form in template DNA. [21,22] Recent reports have indicated 
that transcription is arrested prior to stable structures in the DNA 
template such as a Z-form duplex, [23] triplex, [24,25] or G- 
quadruplex, [26,27] but there have been no quantitative and 
systematic reports the effects of various non-canonical structures 
on arrest, slippage or pause of transcription from the viewpoint of 
stability of non-canonical structures at the energy level. 

The phenomena of arrest, pause, and slippage are ubiquitous 
aspects of transcriptional regulation. Arrest, pause, and slippage 
sites may result from unstable hybridization between template 
DNA and nascent RNA or upon formation of a structure in the 
DNA that interrupts and perturbs elongation. Although formation 
for a DNA/RNA hybrid can result in transcriptional interrup- 
tion, [28] the mechanism of transcription interruption due to non- 
canonical structures may diflFer, because non-canonical structures 
like G-quadruplex have very different structural features and 
stabilities than DNA/RNA hybrids. Here, we evaluated the 
quantitative effects of non-canonical structures formed in the 
template DNA on transcription catalysed by T7 RNA polymerase. 

Results 

Structure and stability of non-canonical structures 

Hairpins (template region sequences of cruciform structure) and 
G-quadruplexes are known to form in template DNA. We 
designed and synthesized ten different template DNAs 
(Figures le. If and Table SI in File SI) in order to evaluate the 
effect of formation of hairpins and G-quadruplexes with different 
thermal stabilities on transcription elongation. The control 
sequence (Linear) should not form significant structure. Sequences 
HI to H3 and Q,l to Q6 were designed to form hairpins or G- 
quadruplexes, respectively, at a site 35 bases downstream from the 
T7 promoter region as shown in Figure le. The G-quadruplex- 
forming sequences are based on the human telomeric sequence 
and the thrombin DNA aptamer sequence and have different 
numbers of G-quartets (Figure If). [16] To confirm the formation 
of the non-canonical structures in the template DNAs, oligonu- 
cleotides containing only the non-canonical structure region 
(linear, hi to h3, and ql to q6) were synthesized (Table S2 in 
File SI). We measured circular dichroism (CD) spectra of these 
DNA oligonucleotides at 37°C (Figure Sla in File SI) and also 
analyzed each by native gel electrophoresis (Figure Sib in File SI). 
These analyses indicated hairpin formation by hi, h2, and h3, and 
G-quadruplex formation by oligonucleotides ql-q6 (Table 1 and 
Figure Sic in File SI). Moreover, formation of G-quadruplexes in 
the template DNAs for Q,l-Q6 was confirmed by florescent 
analysis using protoporphyrin IX, which binds specifically to G- 
quadruplexes and produces fluorescence (Figure S2 in File 
Sl).[19,29] 



The thermodynamic parameters for the formation of hairpins 
and G-quadruplexes were obtained from their thermal melting 
curves by monitoring UV absorbanix' at 295 or 260 nm 
(Figure 2a). [16] The mid-points of the thermal melting transitions 
(T^ values) and thermodynamic parameters for the structure 
formation are given in Table 1 and Table S3 in File SI. The 
values of r„ and -AG°37 (the stability at 37°C) for hi, h2, and h3 
increased with increasing the stem length (4, 9, and 1 3 base pairs, 
respectively). The values for G-quadruplexes based on the 
human telomeric ql, q2, and q3 were 37.3, 62.9, 89.8°C, 
respectively. Those for G-quadruplexes q4, q5 and q6 based on 
the thrombin aptamer sequence were 46.8, 80.5, and >95°C, 
respectively. In both cases, the values increased with the 
number of G-quartet stacks. The values of — AG°37 for the G- 
quadruplex-forming oligonucleotides also increased with increas- 
ing the number of G-quartets. The G-quadruplexes formed by q4, 
q5, and q6 were much more stable than those formed by ql, q2, 
and q3 due to differences in stacking interactions of loop regions 
(Figure Sic in File SI). These thermal analyses indicated that all 
the non-canonical structures should form in a template DNA at 
37°C. 

Effect of non-canonical structures on transcription 

T7 RNA polymerase transcription of the linear template under 
multi-turnover conditions was almost saturated at 90 min (data 
not shown). Inhibitory effects of non-canonical structures in the 
template DNA on transcription were estimated from the amount 
of transcript (product RNA) formed at this time point. Figure 2c 
shows the results of gel electrophoretic analysis of transcription 
carried out for 90 min at 37°C under multi-turnover conditions. 
RNA size was determined by analysis of samples of each reaction 
in parallel with size markers and a 35-nt RNA on a denaturing 
polyacrylamide gel (Figure 2c). When the template with no 
significant structure was used, the transcription proceeded to the 
end of the DNA template, resulting in the formation of a full- 
length transcript of 70 nt (Figure 2c, lane 3). To confirm that the 
Linear template produced mainly full-length transcript, we carried 
out additional experiments to quantitate formation of longer and 
shorter products from this template (Figure S3 in File SI and 
Methods section). Interestingly, transcription of all template DNAs 
able to form non-canonical structures resulted in products in 
addition to the full-length transcript. The products of transcription 
of templates HI, H2, Q,l, Q2, Q4, and Q5 yielded transcripts 
approximately 10-nt longer than the full-length transcript 
(Figure 2c, lanes 4, 5, 7, 8, 10, and 11). In contrast, transcripts 
from the reaction with H3, which formed the most stable hairpin, 
contained a minor product band that migrated at approximately 
60 nt, about 10-nt shorter than the full-length transcript (Figure 2c, 
lanes 6). Previous studies indicate that elongating RNA polymerase 
can slip to produce transcripts longer and shorter by 8 to 10 nts. 
Thus, formation of non-canonical structures appears to induce 
slippage. The template DNAs Q3, Q5, and Q6 able to form the 
very stable G-quadruplexc-s (— Arj^'37 values more than 14.3 kcal 
mol ' in Table 1) induced production of a transcript of about 
35 nt (Figure 2c, lanes 9, 11 and 12). This length suggests that T7 
transcription was arrested at the G-quadruplex structure. Under 
the conditions used here, the hairpins induced slippage, whereas 
G-quadruplexes induced slippage and arrest, with extent of arrest 
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depending on the G-quadruplex stability. We also carried out the 
transcription using template DNA in the presence of complemen- 
tary strand (Figures S4a and S4b in File SI). Template DNAs of 
Q3, Q_5, and Q6 induced production of the arrested transcript 
even in the presence of complementary DNA (Figure S4c in File 
SI). Thus, our model systems revealed the correlation between the 
stability of structures adopted by the template and the transcrip- 
tion efficiency. 

Amount of non-full length transcript depends on the 
stability of non-canonical structures 

To understand how structural stability affected the production 
of slipped and arrested transcripts, we further analysed transcrip- 
tion from template Q5, which resulted in both slipped and arrested 
products (Figure 2c, lane 11). We evaluated transcription in buffer 
with KCl concentrations ranging from 0 to 70 mM (Figure 3), 
because the stability of G-quadruplex strongly depends on K+ 
concentration. [30,3 1,32] UV measurements showed the — AG"37 
values of q5 increased with increasing KCl concentration (Table 
S4 in File SI). In the absence of KCl, the — AG"37 value for q5 
was —0.1 kcal mol ', and slipped, but no arrested, product was 
produced from template Q5 (Figure 3a, lane 3). In the presence of 
10 to 70 mM KCl, the arrested transcript was observed (Figure 3a, 
lanes 4 to 7, red arrows). The amount of slipped transcript 
decreased and the amount of arrested transcript increased with 
increasing KCl concentration. The transcription experiments were 
also performed in various concentrations of LiCl as G-quad- 
ruplexes are significantiy destabilized in the presence of Li^. [33] 
Only slipped transcript was produced in the presence of LiCl 
(Figure 3b, blue arrows). Thus, the slippage was induced even 
when the G-quadruplex was unstable, and increasing amounts of 
arrest were observed as the non-canonical structure was stabilized. 
Only full-length transcript was produced from the liner template in 
the presence of KCl and LiCl (Figure S5 in File SI). 

To confirm that the production of slipped and arrested 
transcripts was caused by formation of a G-quadruplex, transcrip- 
tion experiments were also performed in the presence of G- 
quadruplex-stabilizing ligands, 5,10,15,20-tetrakis (N-methyl-4- 
pridyl) porphyrine (TMPyP4), berberine, and N-methyl mesopor- 
phyrine IX (NMN). [34,35, 36] As we expected, more of the 
arrested transcript was observed for template Q5 in the presence 
of each ligand than without ligand in 3 mM KCl (Figure S6 in File 
SI). The stability of the q5 structure in the presence of each of the 
ligand was significantly enhanced (Table S5 in File SI). We also 
investigated transcription from different template DNAs (Q3, Q6, 
and the platelet-derived growth factor B (PDFB) sequence, [3 7] 
which contain a G-quadruplex-forming sequence). In all case, the 
presence of G-quadruplex forming sequences induced transcrip- 
tion arrest (Table S5 in File SI). 

Quantitative correlation between the stability of non- 
canonical structures and transcription efficiency 

To quantify the effect of the non-canonical structure on the 
transcription, we estimated the correlation between the stability of 
non-canonical structures and transcription efficiency (TE). The 
full-length and slipped transcripts were defined as "run-off 
transcripts" (TEru„.„fr) because the RNA polymerase travels to 
the end of template DNA when these products are produced 
(Figure lb). The TE of transcript (%) was calculated as the 
proportion of the gel band intensity for each transcript from 
structured template DNAs to the band intensity of full-length 
transcript from linear template DNA (Table S6 in File SI). 



Table 1. The stabilities of non-canonical structures designed 
to form in template DNA " 





Sequence'' 


Structure 


7"m' (°C) 


-AG°37 (kcal 
moP') 


Arrest" 


HI (hi) 


hairpin 


55.5 


2.7 ±0.3 


No 


H2 {h2) 


hairpin 


74.5 


7.2 ±0.4 


No 


H3 (h3) 


hairpin 


82.4 


12.7 ±0.2 


No 


Ql (ql) 


antiparallel G-quadruplex 


37.3 


0.1 ±0.1 


No 


Q2 {q2) 


(3+1) mixed G-quadruplex 


62.9 


3.7 ±0.2 


No 


Q3 (q3) 


(3+1) mixed G-quadruplex 


89.8 


n.d."^ 


Yes 


Q4 {q4) 


antiparallel G-quadruplex 


46.8 


1.4 ±0.2 


No 


Q5 (q5) 


parallel G-quadruplex 


80.5 


14.3 ±0.3 


Yes 


Q6 {q6) 


antiparallel G-quadruplex 


>95 


n.d.' 


Yes 



^All experiments were carried out in a buffer containing 30 mM KCl, 40 mM Tris- 
HCI (pH 8.0), 8 mM MgClj, and 2 mM spermidine. 

"^he sequences of template DNAs are shown in Figure lb and Table SI in File 
Sl. The sequences designated by lower case letters contain only the non- 
canonical structure region (see Table S2 in File Sl). 

*^The melting temperature was determined at a strand concentration of 2 ^iM. 
"^Arrest was defined as more than 4% production of arrested product RNA. 
"^The — AG°37 value could not be determined because of very high stability. 
doi:l 0.1 371 /journal.pone.0090580.t001 

The TE,„„.„ff values for HI, H2, and H3 in 30 mM KCl 
solution were 87.3, 84.8, and 76.7%, respectively; thus, the 
transcription efficiency decreased with increasing hairpin stability 
(Table 1 and Table S6 in File Sl). The TErun-off values for Ql, Q2, 
and Q3 in 30 mM KCl solution were 98.0, 66.9, 16.2%, and those 
for Q4, Q5, and Q6 were 99.3, 47.5, and 4.2%, respectively 
(Table S6 in File Sl). Moreover, the TEju^.^fj- values for Q5 in 10 
and 50 mM KCl solution were 68.5 and 45.0%, respectively 
(Table S4 in File Sl). Thus, for these G-quadruplex-forming 
templates the transcription efficiency decreased with increasing G- 
quadruplex stability. In general, the TEj,jj,_„g- should not depend 
on DNA sequence because the rate-determining step of transcrip- 
tion in multi-turnover conditions is the initiation step (that is, RNA 
polymerase binding to promoter region). The TEjuji_og- for 
templates with non-canonical structures clearly decreased depend- 
ing on DNA sequence, which indicated that the pause induced by 
the non-canonical structures was the rate-determining step. For 
the hairpin-forming templates, HI, H2 and H3, the values of 
TEi.un.„£j- decreased with increasing the values of — AG"37 (Figure 
S7a in File Sl). The values of TE„„.„g- for Q\, Q2, Q4, and Q5 
also decreased with increasing — AG°37 (Figure S7a in File Sl), but 
the decrement depended on sequence and topology of G- 
quadruplex. 

We also estimated the correlation between the stability of G- 
quadruplex and transcription efficiency for arrested transcript 
(TFarrest)- Interestingly, we found a good correlation between the 
stability of G-quadruplex structures in the template DNA and the 
TFj,„5.st (Figure S7b in File Sl). The values of TFji^j^^t increased 
with increasing stability of the G-quadruplex. We hypothesize that 
the polymerase cannot unwind a stable G-quadruplex, and, 
therefore, the polymerase stalls. The arrested transcripts are 
produced when the polymerase stalls long enough to dissociate 
from the template. 



PLOS ONE I www.plosone.org 



4 



March 2014 | Volume 9 | Issue 3 | e90580 



Perturbation of Transcription Fidelity 




(b) 



0) 

u 



SI 



1.0 



0.9 



0.8 



OE 

«J«f> 0.7 

NfS 0.6 

75 

§ 0.5 



0.4 




20 40 60 80 

Temperature (°C) 



100 



20 40 60 80 

Temperature (°C) 



100 



(c) 



70- 
50. 

30- 



1 2 3 4 5 6 7 8 9 10 11 12 



] Run off transcripts 
- Arrested transcript 



Figure 2. UV melting curves for oligonucleotides containing sequences in DNA templates (a, b) and effects of non-canonical 
structures in the template DNA on transcription (c). Normalized UV melting curves for 2 |iM (a) hi (blue), h2 (pink), and h3 (green), (b) q1 
(blue), q2 (pink), q3 (green), q4 (red), q5 (black), and q6 (purple) in a buffer containing 30 mM KCI, 40 mM Tris-HCI (pH 8.0), 8 mM MgClj, and 2 mlVI 
spermidine. The values for hi, h2, h3, q1, q2, q3, q4, q5, and q6 were 55.5, 74.5, 82.4, 37.3, 62.9, 89.8, 46.8, 80.5 and >95°C. (c) Denaturing gel 
electrophoresis of products of transcription reactions carried out for 90 min at 37"C. Reaction mixtures contained 0.3 |iiVl T7 polymerase and 1.5 jj.lVI 
DNA template in a buffer containing 30 mlVI KCI, 40 mM Tris-HCI (pH 8.0), 8 mlVI IVIgClj, and 2 mM spermidine. Lane 1, size marker; lane 2, 35-nt RNA; 
lanes 3 to 12, transcription products for Linear, HI, H2, H3, Q1, Q2, Q3, Q4, Q5, and Q6 templates, respectively. Blue and red arrows indicate the 
slipped and arrested transcripts, respectively. 
doi:1 0.1 371 /journal.pone.0090580.g002 



Correlation between the stability of non-canonical 
structures in the presence of polyethylene glycol and 
transcription efficiency 

Intracellular environments are highly crowded with various 
biomolecules; therefore, in vitro studies under molecular crowding 
conditions provide important information on how biomolecules 
behave in cells. [16,38,39] We previously showed that the 
structures of DNA immobilized on a gold surface are totally 
different from those in dilute bulk solution due to molecular 
crowding of immobilized DNAs. [40] Moreover, the presence of 
discrete nanoscale spaces such as those inside micelles and reverse 
micelles induces molecular crowding and alters the stability and 
structures of biomolecules. [4 1,42] During transcription, the 
template DNA duplex is taken into the active site of RNA 
polymerase, therefore, the environment surrounding DNA is very 
different from a dilute solution. We hypothesized that the 
polymerase essentially induces crowding conditions on the 
template DNA and therefore alters the stability of the structures 
adopted by the template. We measured the values of — AG"37 for 
the non-canonical structures formed by the templates in the 



molecular crowding condition induced by 20 wt% polyethylene 
glycol of molecular weight 200 (PEG 200). [16,38,39] As expected 
based on previous studies,[16,17] hairpins formed by hi, h2, and 
h3 were destabilized by addition of 20 wt% PEG 200. In contrast, 
the G-quadruplexes formed by ql, q2, q4, and q5 were stabilized, 
although the stabilities of q3 and q6 were too high to allow 
accurate determination of thermodynamic parameters. In princi- 
ple, formation of DNA structures is accompanied by the formation 
of a hydrogen-bonding network of water surrounding the DNA 
surface that is highly sensitive to the water activity of the 
solution. [16,1 7,43] A decline in the water activity in the presence 
of PEG 200 disfavors formation of structures such as hairpins that 
require uptake of water molecules, and favors formation of 
structures such as G-quadruplex that are accompanied by the 
release of water. [16,43] 

We estimated the correlation between the stability of non- 
canonical structures in the presence of 20 wt% PEG 200 and 
TEjun-off (Figure 4a). The values of TEru„.ofr decreased with 
increasing stability of non-canonical structures in the presence of 
20 wt% PEG 200. The decrement of TEj^j,_„(f was not dependent 
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on topology of non-canonical structures, although values of TE^uj,- 
off seemed to strongly depend on the topology of non-canonical 
structures in the absence of PEG. Furthermore, the amount of 
arrested product increased linearly with the increasing stability of 
non-canonical structures in the presence of 20 wt% PEG 200 
(Figure 4b). Thus, the — AG"37 values for the non-canonical 
structures in the presence of PEG are reflective of the stabilities of 
the non-canonical structures during transcription. 

Discussion 

During transcription, pause or arrest may result from unstable 
hybridization between template DNA and nascent RNA or upon 
formation of a structure in the DNA that interrupts and perturbs 
elongation. The decrease of the transcription efficiency in the 
templates studied here results from a halt in transcription 
elongation at structures formed in the DNA templates. Signif- 
icantly lower transcription efficiencies (as measured based on the 
amount of fuU-length transcript produced) were observed with 
templates able to form G-quadruplex structures relative to 
templates able to form hairpins (Figure 2c and Table S6 in File 
SI). In addition, transcription arrest was induced only by stable G- 
quadruplexes (Figures 2c, 3 and S6 in File SI, and Tables S4, S5, 
and S6 in File SI). Thus, G-quadruplex structures are more 
effective inhibitors of elongation than are hairpin structures, as 
suggested by the correlation between transcript production and 
the stability of structures formed by ohgonucleotides of the same 
sequence (Figure S7 in File SI). 

Our experiments in the presence of PEG 200 indicate that the 
conditions within the polymerase active site might be more similar 
to those induced by molecular crowding agents than to those in an 
aqueous buffer. The dielectric constant is also likely low in the 
region of the polymerase active site: There were several reports 
that the dielectric constant is very low (less than 10) inside proteins 
such as RNase Tl.[44] We and the other groups have shown that 
molecular crowding conditions stabilize G-quadruplexes and 
destabilize hairpins. [16,39,45] Moreover, G-quadruplexes are 
stabilized when the dielectric constant is low, but duplex stabilities 
are unchanged or slightly destabilized. Therefore, the stabilities of 
the G-quadruplexes during the transcription could be higher than 
stabilities estimated based on our thermal melting analyses in the 
absence of PEG 200 and that of hairpins could be lower. 
Additionally, Hoogsteen base pairs are stabilized under the 
molecular crowding conditions [16,39]. Thus, the PEG 200 added 



as the molecular crowding inducer may mimic not only molecular 
crowding but also intra-polymerase conditions. We observed 
correlations between the values of TEju,-,.„jj- or TE^rrest and 
stabilities of non-canonical structures in the presence of 20 wt% 
PEG 200 (Figure 4). 

A model based on the transcription elongation complex (TEC) 
that consists of RNA polymerase, template DNA, and nascent 
RNA at an abasic site suggests that transcription can be 
interrupted by formation of a hybrid between template DNA 
and nascent RNA. [8,46,47] We hypothesize that the mechanism 
of transcription interruption due to non-canonical structures 
diflfers from this model. When the polymerase encounters a non- 
canonical structure that is not readily denatured, the TEC 
complex wiU be destabilized. Due to the destabilization of TEC, 
the polymerase may slip backward or forward on the template 
DNA. Alternatively, the TEC may dissociate. Transcriptional 
arrest was induced when the stability of the structure (— AG"37) in 
the presence of 20 wt % PEG 200 was more than 8.2 kcal mol ' 
(the X-intercept value in Figure 4b), indicating that the polymerase 
is unable to melt structures more stable than 8.2 kcal mol~ . Patel 
and co-workers determined experimentally that the observed free 
energy changes for opening (melting) the template DNA duplex (— 
AGop,,„) in the complex of T7 polymerase and template DNA in 
the promoter during transcription initiation were in the range of 
7.0 to 8.0 kcal mol~ , different from the predicted value (~6 kcal 
mol~') of observed free energy changes by nearest neighbour 
analysis the under physiological ionic conditions. [48] Interestingly, 
the — AG„p,.,-, value from the Patel' s report is similar to the value we 
estimate based on G-quadruplex stability. Thus, the stable non- 
canonical structures might stall the polymerase long enough to 
result in its dissociation from the template. Our estimated values of 
non-canonical stability in the presence of 20 wt% PEG 200 are an 
indicator of stability necessary to result in transcription perturba- 
tion by the non-canonical structures. 

Pause and a slippage is known to occur in homopolymeric A/T 
or C/T tracts in the template DNA[2] and can lead to production 
of mutant protein. It was reported that polymerase backtracking 
appears to pass through a paused, transcriptionally inactive 
intermediate state. [28,49] The molecular basis for backtracking 
is not clear, but it is correlated either with mispairing[50,51,52] or 
very weak pairing of the 3' end of the transcript and the 
template. [49,53] Current models require that the RNA-DNA 
hybrid at the point of arrest be very weak to initiate upstream 
translocation. [50] During transcription elongation, structures 
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Figure 3. Effect of tKie G-quadruplex stability on the production of slipped and arrested transcripts. Denaturing gel electrophoresis of 
products of transcription reactions carried out for 90 min at 37°C using Q5 template. Reaction mixtures contained 0.3 |iiVl T7 polymerase and 1 .5 jj.lVI 
DNA template in a buffer containing 40 mlVI Tris-HCI (pH 8.0), 8 mM IVIgCl2, and 2 mM spermidine and various concentrations of (a) KCI or (b) LiCI. 
Lane 1 shows 10-nt size marker, lane 2 shows 35-nt RNA, and lanes 3 to 7 show transcription products in the presence of 0, 10, 30, 50, and 70 mM (a) 
KCI or (b) LiCI. Blue and red arrows indicate the slipped and arrested transcripts, respectively. 
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formed by the template DNA had not been thought to induce 
shppage, ahhough G-quadruplexes have been shown to induce 
transcription arrest and pause. [27,46] It has also been shown that 
G-quadruplex formation in a promoter region of template DNA 
may inhibit transcription initiation. It was also suggested that a 
purine tract can slippage, but existing models do not account for 
this effect since weak hybridization at the 3 'end of purine- 
containing transcripts are not necessary to provoke arrest. [54] We 
hypothesize that these purine tracts form G-quadruplex structures. 
Our data imply that the effects resulting from the presence of non- 
canonical structures depend on the energy necessary to dissociate 
the non-canonical structures. If the polymerase encounters a non- 
canonical structure at least 8—10 nt upstream of a slippery 
sequence, the polymerase may slip and return to the transcrip- 
tionally competent position (resulting in a long mutated transcript) 
or may dissociate (resulting in an arrested protein product). Hence, 
our fmdings suggest another mechanism for mutation induced by 
non-canonical structures. 

A purine tract found in the HIV-1 sequence [55] and 
trinucleotide repeat (TNR) sequences [56] impact polymerase 
elongation. It was also shown that repeat sequence of (CG)i4 
induces large deletions in transcripts in mammalian cells. [5 7] The 
effects of these sequences are likely caused by non-canonical 
structures such as Z-form, G-quadruplexes, and hairpins (cruci- 
forms). The molecular crowding conditions present inside living 
cells varies with the cell cycle, and non-canonical structure 
formation will thus be regulated by conditions such as concentra- 
tion of cosolutes and cations. [38] Our observations suggest how 
shpped and arrested transcript formation is controlled by 
conditions inside cells from the viewpoint of stability of non- 
canonical structures at the energy level. 

The regulation of transcription elongation impacts the expres- 
sion of many eukaryotic and viral genes. Our results indicate that 
transcriptional arrest by G-quadruplex structures within the 
template DNA depends on the stability of the structures formed. 
Moreover, the transcription arrest could be induced by the 
addition of ligands that bind to and stabilize G-quadruplexes. 
These ligands have potential as drugs for cancer treatment since 



they inhibit gene expression via transcription regulation. Our 
results further our understanding of the impact of template 
structure on the transcription process and may guide design of 
transcription-regialating drugs. 

Conclusions 

We evaluated the quantitative effect of the non-canonical 
structures (G-quadruplexes and hairpins) in template DNA on 
transcription. Previously, it had been thought that G-quadruplexes 
block elongation. Our results indicated quantitatively that 
transcriptional arrest, pause, and slippage can be induced by 
non-canonical structures and hairpins and that the effect is 
dependent on the structure formed and its thermal stability. We 
speculate that G-quadruplex structures are more effective inhib- 
itors of fuU-length product formation than are hairpin structures 
because the intra-polymerase conditions preferentially stabilize the 
G-quadruplex. Thus, the perturbation of transcription fidelity due 
to non-canonical structures can be regulated by the stability of the 
structures formed in the polymerase. 

Materials and Methods 

Materials 

All oligodeoxynucleotides used in this study were purified by 
high-performance liquid chromatography (Japan Bio Service). 
Single-strand concentrations of DNA oligonucleotides were 
calculated from the absorbance measured at 260 nm and 80°C 
using single-strand extinction coefficients calculated from the 
mononucleotide and dinucleotide data according to the nearest- 
neighbor approximation model. 1 The absorbance was measured 
using a Shimadzu 1700 spectrophotometer connected to a 
thermoprogrammer. The concentration of T7 polymerase was 
determined by its absorbance at 280 nm by using 
8280= 1-4x1 0'' M"' cm"'. [58] 
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Figure 4. Correlation between — AG°37 values obtained in 20 wt% PEG200 and transcription efficiency (TE). (a) TE of run-off transcripts 
and (b) TE of arrested transcript. Reaction mixtures contained 0.3 |ilVl T7 polymerase and 1 .5 jj.lVl DNA template in a buffer containing 40 mlVl Tris-HCI 
(pH 8.0), 8 mlVI MgCl2, and 2 mM spermidine, and reactions were incubated for 90 min at 37"C. 
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Circular dichroism (CD) measurements 

CD measurements were made on a JASCO J-820 spectropo- 
larimeter at 2 |xM total DNA strand concentration in buffers 
containing 30 mM KCl, 40 mM Tris-HCl (pH 8.0 at 37"C), 
8 mM MgCl2, and 2 mM spermidine. Tlie spectra at 37°C were 
obtained by taking at least three scans from 200 to 350 nm in a 
cuvette with a pathlength of 0. 1 cm. The temperature of the cell 
holder was regulated by a JASCO PTC-348 temperature 
controller, and the cuvette-holding chamber was flushed with a 
constant stream of dry N2 gas to avoid condensation of water on 
the cuvette exterior. Before the measurement, the samples were 
heated to 95°C, cooled at a rate of — 1°C min~', and incubated at 
37°C for 30 min. 

Gel electrophoresis 

Native gel electrophoresis was carried out on 10% nondenatur- 
ing polyacrylamide gels in buffer containing 30 mM KCl, 40 mM 
Tris-HCl (pH 8.0 at 37°C), 8 mM MgCIg, and 2 mM spermidine 
at 37°C. Loading buffer (1 \iL of 40% glycerol and 1% blue 
dextran) was mixed with 2 nL of 2 |a.M DNA sample. Gels were 
stained with SYBR® Gold (PerkinElmer Life Sciences) and imaged 
using a fluorescent imager (FUJIFILM, FLA-5100). Before the 
measurement, the samples were heated to 95°C, cooled at a rate of 
— 1°C min and incubated at 37°C for 30 min. Unstructured 
DNAs of 12, 15, 25, and 30 nt were electrophoresed in parallel. 
When hairpin or G-quadruplex is formed, migration of the DNA 
should be fast relative to unstructured DNA of the same length. 
The migration of all template DNAs with hairpins were faster than 
that of the 1 2-nt unstructured DNA and migration of templates 
designed to form G-quadruplexes were faster than that of 
unstructured DNA of 15 nt indicating the all DNA sequences 
formed hairpin or G-quadruplexes. 

Fluorescence measurement of protoporphyrin IX (PPIX) 

Template DNA (1 ^lM) were mixed with PPIX (1 ^lM) in buffer 
containing 30 mM KCl, 40 mM Tris-HCl (pH 8.0 at 37°C), 
8 mM MgCl2, and 2 mM spermidine for 30min. Fluorescence 

intensity of PPIX was measured at 37°C using a spectro- 
fluorometer (JASCO, F6500) with 400 nm excitation and 
630 nm emission. Before the measurement, the samples were 
heated to 95°C, cooled at a rate of — 1°C min and incubated at 
37°C for 30 min. 

Thermodynamic analyses 

Ultraviolet (UV) absorbance was measured on a Shimadzu 1 700 
spectrophotometer equipped with a temperature controller. 
Melting curves at 260 or 295 nm were measured in biiffers 
containing 30 mM KCl, 40 mM Tris-HCl (pH 8.0 at 37°C), 
8 mM MgCl2, and 2 mM spermidine. Samples were heated at a 
rate of 0.5 or 0.1°C min~'; melting cur\es of certain oligonucle- 
otides differed at 0.1 and 0.5°C min ' (data not shown). 
Thermodynamic parameters (Tables 1 and S3 in File SI) were 
calculated using the average values obtained from curve fitting at 
the different DNA concentrations (2, 5, 10, and 20 |a,M). Before 
the measurements, the DNA samples were heated to 95°C, cooled 
to 0°C at a rate of — 0.5°C min ', and incubated at 0°C for 
30 min. 
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